# ONNX Format

Qwen3 1.7B ONNX
Qwen3-1.7B is a 1.7B-parameter open-source large language model released by Alibaba Cloud, based on the Transformer architecture, supporting various natural language processing tasks.
Large Language Model Transformers
Q
onnx-community
189
1
Stt Ru Fastconformer Hybrid Large Pc Onnx
NVIDIA FastConformer-Hybrid Large is a Russian automatic speech recognition model based on the FastConformer architecture, supporting CTC and RNN-T decoders.
Speech Recognition
S
istupakov
163
1
Grounding Dino Tiny ONNX
Apache-2.0
A lightweight zero-shot object detection model in ONNX format, compatible with Transformers.js, suitable for browser-side deployment.
Object Detection Transformers
G
onnx-community
98
1
Granite Timeseries Patchtst
IBM Granite series time series forecasting model, based on PatchTST architecture, suitable for various time series forecasting tasks.
Climate Model Transformers
G
onnx-community
182
1
Mediapipe Selfie Segmentation Landscape
Apache-2.0
A lightweight portrait segmentation model in ONNX format, specifically optimized for separating people from backgrounds in landscape images.
Image Segmentation
M
onnx-community
45
3
Timesformer Hr Finetuned K600
TimeSformer-HR is a video action recognition model optimized for high-resolution videos and fine-tuned on the Kinetics-600 dataset.
Video Processing Transformers
T
onnx-community
17
0
Timesformer Base Finetuned Ssv2
TimeSformer is a Transformer-based video understanding model specifically optimized for temporal action recognition tasks.
Video Processing Transformers
T
onnx-community
17
0
Timesformer Base Finetuned K600
TimeSformer is a video understanding model based on the Transformer architecture, specifically designed for video classification tasks.
Video Processing Transformers
T
onnx-community
16
0
Timesformer Base Finetuned K400
TimeSformer is a Transformer-based video understanding model, specifically fine-tuned on the Kinetics-400 dataset.
Video Processing Transformers
T
onnx-community
17
0
Whisper Base.en
Whisper is a general-purpose speech recognition model trained by OpenAI. This model is based on large-scale weakly supervised training and supports speech transcription in multiple languages.
Speech Recognition Transformers
W
onnx-community
76
1
Whisper Base
Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting multilingual speech transcription.
Speech Recognition Transformers
W
onnx-community
5,704
19
4x APISR GRL GAN Generator Onnx
Gpl-3.0
GAN-based 4x super-resolution image upscaling model, compatible with Transformers.js
Image Enhancement Transformers
4
Xenova
52
10
Gyr66 Bert Base Chinese Finetuned Ner Onnx
Apache-2.0
This is the ONNX format conversion version of the gyr66/bert-base-chinese-finetuned-ner model, designed for Chinese named entity recognition tasks.
Sequence Labeling Transformers Chinese
G
protectai
171
1
Depth Anything Large Hf
ONNX version of depth estimation model based on Transformers.js, suitable for web applications
3D Vision Transformers
D
Xenova
19
3
Fmops Distilbert Prompt Injection Onnx
Apache-2.0
This is the ONNX format conversion of the fmops/distilbert-prompt-injection model, designed for detecting prompt injection attacks.
Large Language Model Transformers English
F
protectai
23
0
Bert Base NER Onnx
MIT
This is the ONNX format version of the dslim/bert-base-NER model for named entity recognition tasks, capable of identifying four entity types: location, organization, person, and miscellaneous.
Sequence Labeling Transformers Supports Multiple Languages
B
protectai
19.94k
4
Dpt Hybrid Midas
Hybrid depth estimation model developed by Intel, combining the advantages of convolutional neural networks and Transformer architecture
3D Vision Transformers
D
Xenova
23
0
Swin2sr Lightweight X2 64
Lightweight Swin2SR image super-resolution model that can upscale image resolution by 2 times
Image Enhancement Transformers
S
Xenova
21
0
Swin2sr Classical Sr X2 64
A classical image super-resolution model based on Swin2SR architecture, capable of upscaling image resolution by 2 times
Image Enhancement Transformers
S
Xenova
47
3
Trocr Base Handwritten
A Transformer-based handwritten text recognition model that converts handwritten images into text
Image-to-Text Transformers
T
Xenova
74
2
Trocr Small Handwritten
A small Transformer-based handwritten text recognition model optimized for web usage
Text Recognition Transformers
T
Xenova
104
6
Trocr Base Printed
TrOCR is a Transformer-based OCR model specifically designed for recognizing printed text.
Text Recognition Transformers
T
Xenova
40
0
Trocr Small Printed
TrOCR-small-printed is a compact optical character recognition (OCR) model specifically designed for printed text recognition.
Text Recognition Transformers
T
Xenova
79
3
Gbert Large Paraphrase Cosine Onnx
MIT
A German text embedding model based on sentence-transformers, mapping text to a 1024-dimensional vector space, specifically designed to enhance few-shot text classification performance in German
Text Embedding Transformers German
G
blackcodetavern
18
0
Distilbart Cnn 12 6
DistilBART-CNN-12-6 is a distilled version of the BART model, optimized for text summarization tasks, with a smaller size while maintaining high performance.
Text Generation Transformers
D
Xenova
218
0
Yolos Small
YOLOS-small is a small object detection model based on the Transformer architecture, designed for efficient visual tasks.
Object Detection Transformers
Y
Xenova
63
0
Yolos Tiny
YOLOS-tiny is a lightweight object detection model based on the Transformer architecture, suitable for real-time object detection tasks.
Object Detection Transformers
Y
Xenova
1,912
5
Tinystories 1M ONNX
TinyStories-1M-ONNX is a small language model based on the ONNX format, suitable for text generation tasks.
Large Language Model Transformers English
T
mkly
63
2
E5 Small V2
E5-small-v2 is an efficient text embedding model suitable for various natural language processing tasks.
Text Embedding Transformers
E
Supabase
35
2
Wav2vec2 Base 960h
ONNX format conversion of Facebook's wav2vec2-base-960h model, designed for Transformers.js, supporting browser-side speech recognition
Speech Recognition Transformers
W
Xenova
117
3
Mms Lid 4017
MMS-LID-4017 is a speech recognition model supporting 4017 languages, developed by Facebook, focusing on language identification tasks.
Text Classification Transformers
M
Xenova
15
1
Mms Lid 126
MMS-LID-126 is a multilingual speech recognition model released by Facebook, supporting recognition of 126 languages.
Text Classification Transformers
M
Xenova
14
0
Ast Finetuned Speech Commands V2
A voice command recognition model based on AST architecture, optimized for web deployment in ONNX format
Audio Classification Transformers
A
Xenova
15
0
Ast Finetuned Audioset 10 10 0.4593
Audio Spectrogram Transformer (AST) model fine-tuned on the AudioSet dataset for audio classification tasks
Audio Classification Transformers
A
Xenova
82
0
Whisper Medium
Whisper Medium is a medium-scale speech recognition model developed by OpenAI, supporting automatic speech recognition (ASR) tasks in multiple languages.
Speech Recognition Transformers
W
Xenova
871
4
Detr Resnet 101
End-to-end object detection model based on Transformer architecture with ResNet-101 feature extractor
Object Detection Transformers
D
Xenova
216
2
Whisper Small
Whisper Small is a small automatic speech recognition (ASR) model developed by OpenAI, capable of converting speech into text.
Speech Recognition Transformers
W
Xenova
1,716
9
Whisper Base
Whisper is an automatic speech recognition (ASR) system trained by OpenAI, supporting speech-to-text tasks in multiple languages.
Speech Recognition Transformers
W
Xenova
6,204
7
Whisper Tiny
Whisper Tiny is a lightweight speech recognition model open-sourced by OpenAI, suitable for web deployment.
Speech Recognition Transformers
W
Xenova
21.70k
8
Bart Large Cnn
A large-scale text summarization model based on the BART architecture, optimized for the CNN/DailyMail dataset
Text Generation Transformers
B
Xenova
173
8
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase